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3 <110> APPLICANT: DING, SHI -YOU 

4 ADNEY, WILLIAM S. 

5 VINZANT , TODD B. 

6 DECKER, STEPHEN R. 

7 HIMMEL, MICHAEL E. 

9 <120> TITLE OF INVENTION: THERMAL TOLERANT CELLULASE FROM ACIDOTHERMUS 
10 CELLULOLYTICUS 
12 <130> FILE REFERENCE: 40170. 6US01 
C--> 14 <140> CURRENT APPLICATION NUMBER: US/09/917 , 383 
15 <141> CURRENT FILING DATE: 2001-07-28 
17 <160> NUMBER OF SEQ ID NOS : 14 
19 <170> SOFTWARE: Patentln Ver . 2.1 

21 <210> SEQ ID NO: 1 

22 <211> LENGTH: 1228 

23 <212> TYPE: PRT 

24 <213> ORGANISM: Artificial Sequence 

26 <220> FEATURE: 

27 <223> OTHER INFORMATION: Description of Artificial Sequence: Segment of 
2 8 GuxA 
30 <400> SEQUENCE: 1 
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65 180 185 190 

67 Thr lie lie Glu Pro Asp Ser Leu Pro Asn Ala Val Thr Asn Met Ser 

68 195 200 205 

70 He Gin Ala Cys Ala Thr Ala Val Pro Tyr Tyr Glu Gin Gly He Glu 

71 210 215 220 

73 Tyr Ala Leu Thr Lys Leu His Ala He Pro Asn Val Tyr He Tyr Met 

74 225 230 235 240 

76 Asp Ala Ala His Ser Gly Trp Leu Gly Trp Pro Asn Asn Ala Ser Gly 

77 '245 250 255 

79 Tyr Val Gin Glu Val Gin Lys Val Leu Asn Ala Ser He Gly Val Asn 

80 260 265 270 

82 Gly He Asp Gly Phe Val Thr Asn Thr Ala Asn Tyr Thr Pro Leu Lys 

83 275 280 285 

85 Glu Pro Phe Met Thr Ala Thr Gin Gin Val Gly Gly Gin Pro Val Glu 

86 290 295 300 

88 Ser Ala Asn Phe Tyr Gin Trp Asn Pro Asp He Asp Glu Ala Asp Tyr 

89 305 310 315 320 

91 Ala Val Asp Leu Tyr Ser Arg Leu Val Ala Ala Gly Phe Pro Ser Ser 

92 325 330 335 

94 He Gly Met Leu He Asp Thr Leu Arg Asn Gly Trp Gly Gly Pro Asn 

95 340 345 350 

97 Glu Pro Thr Gly Pro Ser Thr Ala Thr Asp Val Asn Thr Phe Val Asn 

98 355 360 365 

100 Gin Ser Lys He Asp Leu Arg Gin His Arg Gly Leu Trp Cys Asn Gin 

101 370 375 380 

103 Asn Gly Ala Gly Leu Gly Gin Pro Pro Gin Ala Ser Pro Thr Asp Phe 

104 385 390 395 400 

106 Pro Asn Ala His Leu Asp Ala Tyr Val Trp He Lys Pro Pro Gly Glu 

107 405 410 415 

109 Ser Asp Gly Thr Ser Ala Ala Ser Asp Pro Thr Thr Gly Lys Lys Ser 

110 420 425 430 

112 Asp Pro Met Cys Asp Pro Thr Tyr Thr Thr Ser Tyr Gly Val Leu Thr 

113 435 440 445 

115 Asn Ala Leu Pro Asn Ser Pro He Ala Gly Gin Trp Phe Pro Ala Gin 

116 450 455 460 

118 Phe Asp Gin Leu Val Ala Asn Ala Arg Pro Ala Val Pro Thr Ser Thr 

119 465 470 475 480 

121 Ser Ser Ser Pro Pro Pro Pro Pro Pro Ser Pro Ser Ala Ser Pro Ser 

122 485 490 495 

124 Pro Ser Pro Ser Pro Ser Pro Ser Ser Ser Pro Ser Pro Ser Pro Ser 

125 500 505 510 

127 Pro Ser Ser Ser Pro Ser Pro Ser Pro Ser Pro Ser Pro Ser Pro Ser 

128 515 520 525 

130 Ser Ser Pro Ser Pro Ser Pro Ser Ser Ser Pro Ser Pro Ser Pro Ser 

131 530 535 540 

133 Pro Ser Pro Ser Pro Ser Ser Ser Pro Ser Pro Ser Pro Ser Ser Ser 

134 545 550 555 560 

136 Pro Ser Pro Ser Pro Ser Pro Ser Pro Ser Pro Ser Ser Ser Pro Ser 

137 565 570 575 
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212 965 970 975 

214 Thr Thr Gly Gin Pro Asn Gly Thr Glu He Met He Trp Leu Asn Ser 

215 980 985 990 

217 Arg Gly Gly Val Gin Pro Phe Gly Ser Gin Thr Ala Thr Gly Val Thr 

218 995 1000 1005 

220 Val Ala Gly His Thr Trp Asn Val Trp Gin Gly Gin Gin Thr Ser Trp 

221 1010 1015 1020 

223 Lys He He Ser Tyr Val Leu Thr Pro Gly Ala Thr Ser He Ser Asn 

224 1025 1030 1035 1040 

226 Leu Asp Leu Lys Ala He Phe Ala Asp Ala Ala Ala Arg Gly Ser Leu 

227 1045 1050 1055 

229 Asn Thr Ser Asp Tyr Leu Leu Asp Val Glu Ala Gly Phe Glu He Trp 

230 1060 1065 1070 

232 Gin Gly Gly Gin Gly Leu Gly Ser Asn Ser Phe Ser Val Ser Val Thr 

233 1075 1080 1085 

235 Ser Gly Thr Ser Ser Pro Thr Pro Ser Pro Ser Pro Thr Pro Thr Pro 

236 1090 1095 1100 

238 Ser Pro Thr Pro Thr Pro Ser Pro Ser Pro Thr Pro Ser Pro Ser Pro 

239 1105 1110 1115 1120 

241 Thr Ser Ser Pro Ser Ser Ser Gly Val Ala Cys Arg Ala Thr Tyr Val 

242 1125 1130 1135 

244 Val Asn Ser Asp Trp Gly Ser Gly Phe Thr Ala Thr Val Thr Val Thr 

245 1140 1145 1150 

247 Asn Thr Gly Ser Arg Ala Thr Asn Gly Trp Thr Val Ala Trp Ser Phe 

248 1155 1160 1165 

250 Gly Gly Asn Gin Thr Val Thr Asn Tyr Trp Asn Thr Ala Leu Thr Gin 

251 1170 1175 1180 

253 Ser Gly Ala Ser Val Thr Ala Thr Asn Leu Ser Tyr Asn Asn Val He 

254 1185 1190 1195 1200 

256 Gin Pro Gly Gin Ser Thr Thr Phe Gly Phe Asn Gly Ser Tyr Ser Gly 

257 1205 1210 1215 

259 Thr Asn Ala Ala Pro Thr Leu Ser Cys Thr Ala Ser 

260 1220 1225 

263 <210> SEQ ID NO : 2 

264 <211> LENGTH: 3687 

265 <212> TYPE: DNA 

266 <213> ORGANISM: Artificial Sequence 

268 <220> FEATURE: 

269 <223> OTHER INFORMATION: Description of Artificial Sequence: Segment of 

270 GuxA 

272 <400> SEQUENCE: 2 

273 atggagcgaa cccaacaatc cggacggaac tgcaggtacc agagaggaac gacacgaatg 60 

274 cccgccatct caaaacggct gcgagccggc gtcctcgccg gggcggtgag catcgcagcc 120 

275 tccatcgtgc cgctggcgat gcagcatcct gccatcgccg cgacgcacgt cgacaatccc 180 

276 tatgcgggag cgaccttctt cgtcaacccg tactgggcgc aagaagtaca gagcgaagcg 240 

277 gcgaaccaga ccaatgccac tctcgcagcg aaaatgcgcg tcgtttccac atattcgacg 300 

278 gccgtctgga tggaccgcat cgctgcgatc aacggcgtca acggcggacc cggcttgacg 360 

279 acatatctgg acgccgccct ctcccagcag cagggaacca cccctgaagt cattgagatt 420 

280 gtcatctacg atctgccggg acgcgactgc gcggcgctcg cctccaacgg cgaactgccc 480 
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281 gctacggcag caggtttgca gacctatgaa acgcagtaca tcgatccgat tgcgagtatc 540 

282 ctgagcaatc cgaagtactc cagcctgcgg atcgtgacga tcattgagcc ggactcgctg 600 

283 ccaaacgcgg tcaccaatat gagcattcaa gcgtgtgcaa cggcggtgcc gtattacgag 660 

284 caaggcatcg agtacgcgct cacgaaattg cacgccattc cgaacgtgta catctacatg 720 

285 gacgccgccc actccggctg gcttgggtgg cccaataatg ccagcggata cgtacaggaa 780 

286 gtccagaagg tcctcaacgc gagcatcggg gtcaacggca tcgacggctt cgtcaccaac 840 

287 acggcgaatt acacgccgtt gaaggagccg ttcatgaccg ccacccagca ggtcggcgga 900 

288 cagccggtgg agtcggcgaa tttctaccag tggaatcctg acatcgacga agccgactac 960 

289 gcggttgact tgtactcgcg gctcgtcgcc gctggctttc caagcagcat cggcatgctc 1020 

290 atcgacacct tacgcaacgg ttggggtggt ccgaacgaac caacaggccc gagcaccgcg 1080 

291 accgatgtca acaccttcgt caaccagtcg aagattgacc ttcggcagca ccgcggcctg 1140 

292 tggtgcaacc agaacggtgc gggcctcggc cagccgccgc aggcaagccc gacggacttc 1200 

293 ccgaacgcgc acctcgacgc gtatgtctgg atcaagccgc cgggtgagtc ggacggcaca 1260 

294 agcgctgcga gcgatccgac aactggcaag aagtcggacc ccatgtgcga cccgacgtac 1320 

295 acgacgtcgt acggggtact gaccaacgcg ttaccgaact ccccgatcgc cggccagtgg 1380 

296 ttcccggcgc agtttgacca gcttgtcgcg aacgcacggc cagcggtgcc gacgtcgacc 1440 
2 97 agctcgagcc cgccgcctcc gccgccgagt ccgtcggctt cgccgagtcc gagcccgagt 1500 

298 ccgagcccga gcagctcgcc atcgccgtcg ccgtctccga gctcgagccc gtctccgtcg 1560 

299 ccgagcccga gtccgagccc gagtagctcg ccgtcgccgt ctccgagctc gagcccgtct 1620 

300 ccgtcgccga gcccgagtcc gagcccgagt agctcgccgt cgccgtctcc gagctcgagc 1680 

301 ccgtctccgt cgccgagccc gagtccgagc ccgagtagct cgccgtcgcc gtctccgacg 1740 

302 tcgtcgccgg tgtcgggtgg gctgaaggtg cagtacaaga acaatgattc ggcgccgggt 1800 

303 gataaccaga tcaaaccggg tctccagttg gtgaataccg ggtcgtcgtc ggtggatttg 1860 

304 tcgacggtga cggtgcggta ctggttcacc cgggatggtg ggtcgtcgac actggtgtac 1920 

305 aactgtgact gggcggcgat ggggtgtggg aatatccgcg cctcgttcgg ctcggtgaac 1980 

306 ccggcgacgc cgacggcgga cacctacctg cagttgtcgt tcactggtgg aacgttggcc 2040 

307 gctggtgggt cgacgggtga gattcaaaac cgggtgaata agagtgactg gtcgaatttc 2100 

308 accgagacca atgactactc gtatgggacg aacaccacct tccaggactg gacgaaggtg 2160 

309 acggtgtacg tcaacggcgt gttggtgtgg gggactgaac cgtccggcac cagccccagc 2220 

310 cccacaccat ccccgagccc gagcccgagc ccgagcccgg gtggggatgt gacgccgccg 2280 

311 agtgtgccga ccggcttggt ggtgacgggg gtgagtgggt cgtcggtgtc gttggcgtgg 2340 

312 aatgcgtcga cggataacgt gggggtggcg cattacaacg tgtaccgcaa cggggtgttg 2400 

313 gtgggccagc cgacggtgac ctcgttcacc gacacgggtt tggccgcggg aaccgcgtac 2460 

314 acctacacgg tggccgcggt ggacgctgcg ggtaacacct ccgccccatc cacccccgtc 2520 

315 accgccacca ccacgagtcc cagccccagc cccacgccga cggggaccac ggtcaccgac 2580 

316 tgcacgcccg gtcctaacca gaatggtgtg accagcgtgc agggcgacga ataccgggtg 2640 

317 cagaccaatg agtggaattc gtcggcccag cagtgcctca ccatcaatac cgcgaccggt 2700 

318 gcctggacgg tgagcactgc gaacttcagc ggtgggaccg gcggtgcgcc cgcgacgtat 2760 

319 ccgtcgatct acaagggctg ccactggggc aactgcacca cgaagaacgt cgggatgccg 2820 

320 attcagatca gtcagattgg ttcggctgtg acgtcgtgga gtacgacgca ggtgtcgtcg 2880 

321 ggcgcgtatg acgtggccta cgacatttgg acgaacagta ccccaacgac aaccggtcag 2940 

322 ccaaacggta ccgaaatcat gatttggctg aattcgcgtg gtggggtgca gccgttcggg 3000 

323 tcgcagacag cgacgggtgt gacggtcgct ggtcacacgt ggaatgtctg gcagggtcag 3060 

324 cagacctcgt ggaagattat ttcctacgtc ctgacccccg gtgcgacgtc gatcagtaat 3120 

325 ctggatttga aggcgatttt cgcggacgcc gcggcacgcg ggtcgctcaa cacctccgat 3180 

326 tacctgctcg acgttgaggc cgggtttgag atctggcaag gtggtcaggg cctgggcagc 3240 

327 aactcgttca gcgtctccgt gacgagcggc acgtccagcc cgacaccgag cccgagcccg 3300 

328 acgccgacac cgagcccgac gccgacaccg tctccgagcc cgaccccgtc gccgagtccg 3360 

329 accagctcgc cgtcgtcgtc gggtgtggcg tgccgggcga cgtatgtggt gaatagtgat 3420 
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